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Abstract: The notion of a surface-order specific entropy h c (P) of a two-dimensional discrete random 
field P along a curve c is introduced as the limit of rescaled entropies along lattice approximations 
of the blowups of c. Existence is shown by proving a corresponding Shannon-McMillan theorem. We 



obtain a representation of h c (P) as a mixture of specific entropies along the tangent lines of c. As 
an application, the specific entropy along curves is used to refine Follmer and Ort's lower bound for 

i > ■ 

q , the large deviations of the empirical field of an attractive Gibbs measure from its ergodic behaviour 

\ in the phase-transition regime. 

o\ 

(N 

Oh ■ 1 Introduction 

The entropy of a stationary random field P indexed by a lattice is usually defined as a limit of 

entropies on an increasing sequence of boxes, rescaled by the volume of the boxes. Shannon-McMillan 

theorems describe this convergence on a deeper level, as L 1 -convergence of rescaled information 

quantities. In the context of large deviations for Gibbs measures the volume-order entropies may 

not provide enough information when a phase transition occurs. Instead, Follmer and Ort [9 

introduced the concept of surface-order entropy on boxes, proved a corresponding version of the 

£\j ■ Shannon-McMillan theorem and used it to estimate large-deviation probabilities. The construction 

. of the Wulff shape by Dobrushin, Kotecky, and Shlosman [4j suggests that such estimates can be 

' improved if boxes are replaced by more general shapes. 

!> . 

We investigate the problem of constructing entropies on general surfaces, proving appropriate 
versions of the Shannon-McMillan theorem, and using these constructions to refine the large devi- 
ation lower bound. We carry out this program in the two-dimensional case, where surfaces reduce 
to contour curves. The existence of surface-order specific entropy does not simply follow from a 
subadditivity argument. Instead, we prove a corresponding Shannon-McMillan theorem, that is, 
L 1 (P)-convergence of rescaled information quantities along lattice approximations of the successive 
blowups of the curve c. This is accomplished in three steps. The first main result is a Shannon- 
McMillan theorem along lines; cf. Theorem 13.51 for rational slopes and Theorem 13.61 for irrational 
slopes. The proof relies on uniform convergence in ergodic theorems for a suitable skew product 
transformation. The second step is to extend the results to polygons under the assumption of a 
strong 0-1 law on P. Finally, we obtain an explicit formula for the specific entropy h c (P) as a mix- 
ture of the conditional entropy of the random field restricted to the origin, given suitably defined 
pasi-cx-algebras along tangent lines of the curve (Theorem 14. 3p . Under certain conditions, this con- 
struction can be extended to relative entropies of one random field with respect to another. This 
will be the key to our proof of a refined lower bound for large deviations in the phase transition 
regime (Theorem [ 
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The role of Shannon- McMillan theorems in the refined analysis of large deviations provided the 
original motivation for this work. It seems, however, that the study of entropies along surfaces 
may hold independent interest. This paper lays some of the groundwork for such a general theory 
of specific entropies along shapes. The key to this theory is a careful combination of probabilistic 
arguments and non-standard discrete geometrical constructions. We now explain our approach and 
our results in more detail. 

Shannon-McMillan theorems and entropy. Consider a random sequence ui of letters from 
a finite alphabet T, modelled by a probability measure P on the space Q := Y^ 1 ' 2 '"' ^ . For any finite 
n, the information provided by the first n letters can be described by the function — logP[w{! ,...,«}], 
where aJu n } denotes the restriction of co to {1, . . . , n}, and P[(?/i, . . . , y n y\ is the probability that 
the pattern (yi, . . . ,y n ) appears in the first n trials. In the classical case, when the letters are 
independent and identically distributed according to a measure fi, the classical Shannon-McMillan 
theorem states that the rescaled information functions — n^ 1 \ogP[ui^i n y], converge in L (P) to 
the entropy H(fi) := ~^2 ye x fJ-(y)^og/i(y) of the measure /i. The theorem can be extended to a 
general ergodic sequence. In the bilateral case, when P is an ergodic measure on T z , the limiting 
quantity takes the form h(P) := E\H(Pq[- \ V](uj))] , where Pq[ ■ | V\ is the distribution of the random 
field in (0,0) conditioned on the "past" c-algebra V generated by the projection of u> to the set 



These constructions can be extended to a spatial setting when the random field is given by a 
stationary probability measure P on a configuration space Y z . Thouvenot [37J and Follmer [Hj 
proved spatial extensions of Shannon and McMillan's result. The specific entropy h(P) is introduced 
as the limit of ll/^l^iJv^ (P), where V n is the set of all lattice sites in the box [— n, n) d , and Hy n (P) 
is the entropy of the measure P restricted to V n . The existence of the limit follows from the sub- 
additivity of Hy with respect to V. What is more, the corresponding Shannon-McMillan theorem 
shows L X (P) -convergence of the rescaled information functions — \V n \ 1 logPfwyJ. If P is ergodic 
we obtain the formula h{P) = P[P(p>[ • | V d }(iv))] , where V d is a a-algebra representing a spatial 
version of the "past" . More precisely, V d is generated by the projections of u) to the sites preceding 
the origin in the lexicographical ordering of 1 d . 

Surface entropy. Our goal is to derive refined versions of the Shannon-McMillan theorem, where 
the information functions are observed along surfaces. This was carried out in [5] for the surfaces 
of boxes parallel to the axes. In this paper, we focus on the two-dimesional case. We develop a 
construction of surface entropy where rectangles are replaced by general curves. More precisely, 
following a suggestion by Hans Follmer, we introduce the specific entropy along sets generated by 
lattice approximations to lines, and then extend this to polygons and piecewise smooth curves. 

Most of the work here goes into our first result, a Shannon-McMillan theorem for the specific 
entropy h\{P) of a stationary random field P along a line (Theorem [33] if the slope is rational and 
Theorem 13. 61 if the slope is irrational). We prove the L X (P) -convergence of the rescaled information 
functions along increasing segments of the line's lattice approximation {(z, [Xz + a]) | z G Z}, where 
[x] denotes the integer part of x. If P fulfills a 0-1 law on the tail field, we obtain the formula 



Jo 

where is the a-algebra generated by those approximating sites which precede in the lexico- 
graphical ordering of Z 2 . If A is rational, the it suffices for P to be ergodic. Furthermore, the specific 
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entropy along the line can be written as 

1 S=i 

where p/q is the unique representation of A by integers p€ N and q G Z having no common divisor. 
The past er-algebras V P /q, vp / q correspond to the q different possibilities to start the g-periodic pattern 
of the lattice approximation. 

The idea to investigate such an specific entropy along lines has two precursors. The first is a 
volume-order directional entropy, which Milnor ( |16j and |17j ) introduced in the context of cellular 
automata. The second is the specific entropy along hyperplanes perpendicular to an axis, which was 
defined by [5] as a step toward their surface entropy along boxes. It may be noted that a distinction 
between rational and irrational slopes was also made by Sinai in his work [25] on Milnor's directional 
entropy for cellular automata. This construction was further developed by Park ([15], [20] . and [2"T] ) 
and Sinai [2S] • The original problem of continuity with respect to the direction was finally solved in 
Park [22]. 

Our next result is a Shannon-McMillan theorem along polygons (Theorem 14. 2p . In particular, 
we obtain a representation of the specific entropy of P along a polygon as a mixture of entropies 
along lines corresponding to its edges. The extension to polygons requires a strong form of the 0-1 
law on the tail field, which was introduced in [9]. It says that, for any subset J of Z 2 , the c-algebra 
generated by the sites in J does not increase if we add information about the tail behaviour outside 
of J; cf. Definition 12.21 Finally, we come to the main result in this section. Theorem 14.31 says that 
the specific entropy along a curve c : [0,T] — > R 2 is a mixture of entropies along the tangent lines: 

h c (P) = [ h d(t) {P)dt (1) 
Jo 

Here, h c >^(P) denotes the specific entropy along a line having the same slope as the tangent of c 
in t; cf. ([52")) for the exact definition. 

About the proofs of the Shannon-McMillan theorems. We will proceed in several steps, 
first for lines, for polygons and finally for curves. 

(i) Lines: The lattice approximation of a line with slope A G [0,1] and y-intercept a on an 
interval [0, n] is defined by L\ a (I) = {(z, [Xz + a]) \ z = 1, n}. We want to prove the convergence 
of the rescaled information content — (n + 1) _1 \ogP[uj Lx a no,n])] along sucessively larger segments 
of the line. To make this problem accessible to ergodic theory, we have to find a transformation 
which captures the stair climbing pattern along the lattice approximation of the line. If the slope 
is rational, the steps become periodic, and we proceed by combining a finite number of different 
transformations. In the case of an irrational slope, no such simplification is possible. We need to 
keep track not only of the integer part but also of the fractional part {Xz + a} in each step. We can 
realize this by introducing the skew product transformation 

S\: TxS! — >Txfl 

(*.^) 1 — > (T\(t),d(i, [t+A])w), 

where T is the one-dimensional torus, equipped with the Borel cr-algebra and the Haar measure, 
and T\ is the translation by A. Using appropriate ergodic theorems on the product space with the 
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skew product we obtain a Shannon-McMillan theorem along the lattice approximation of the line. 
In view of the extension of this result to polygons, we further need a variation of this result. Instead 
of lattice approximations increasing parts of a line, we use lattice approximations of blow-ups of a 
line segment. 

(it) Polygons: Consider a polygon tt, parametrized on [0,T], and its blowups B n n(t) = nnCt/nA, 
parametrized on [0, nT]. We study the sequence — logPjVt*] (n £ N) of rescaled informations of P 
restricted to its lattice approximations Conditioning site by site, the problem can be reduced 
to the Shannon-McMillan theorem along the edges, which is essentially covered by the Shannon- 
McMillan theorem along lines that we already established. One difficulty remains, which is getting 
around the corners. It can be overcome by the technique which [9] used in the case of boxes. We 
need the additional assumption of a strong form of a 0-1 law (Definition 12. 2[) . Under this condition, 
the entropy along a polygon is represented as a mixture of the entropies of its edges (Theorem [ 



(Hi) Curves: Our last step is the entropy along a piecewise smooth curve. After having established 
the results for lines and polygons, this part has become easy. We obtain ([I]) by approximating the 
curve with polygons. 

Relative entropy and large deviations. Shannon-McMillan theorems for the specific relative 
entropy h(Q,P) of two probability measures Q and P on the sequence space Y* 1 ' 2 '"'} are based 
on the functions — log(dQ/dP) [lo{\ n y] (n £ N) describing the relative information gained from 
the first n trials of an experiment. They are a key tool in the search for estimates in the theory 
of large deviations. By a large deviation we mean a rare event, or an untypical behavior occuring 
in a random sequence. Consider the empirical distributions := ^ X}j=i ^ ( n S N) of a 

stationary random sequence Wj (i € N). If the measure P is ergodic then \x n converges to the marginal 
distribution /i of P, P-almost surely and in L 1 (P). Large deviations are events like [fj, n £ A], A being 
a set in the space of probability measures on T whose closure does not contain \i. 

The aim of large deviation theory is to find lower and upper bounds which describe the asymptotic 
decay of the probabilities of such large deviations. In the classical case of a sequence of independent 
and identically distributed random variables, the decay of large deviations of the empirical distribu- 
tion is described by Sanov's theorem. Cramer's theorem addresses similar questions for the empirical 
averages. As a third level for investigating large deviations, Donsker and Varadhan [5] initiated the 
investigation of large deviations of empirical processes. 

We replace the random sequence by a random field, and the empirical processes by the empirical 
fields Rn(ui) := |V^| _1 Yliev ( n G N), where fli (i £ Z d ) denotes the group the shift transforma- 
tions. Comets [2], Follmer and Orey [H], and Olla [THj found the following large deviation principle 
for the empirical fields of a stationary Gibbs measure: For any open subset A of the space M.x(£l), 
of probability measures on fl — T z , 

liminf _L \ gP(R n £ A) > - inf h(Q,P), (2) 
n^oo \v n \ QeAnMi(n) 

and for any closed set C £ A4x(Q), 

hmsup log P(R n eC)<- inf h(Q, P), (3) 
rwoo \V n \ QecnMi(n) 

where the rate function is based on the specific relative entropy h(Q, P) := liniyj^oo | V n | Hv n (Q,P). 
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Large deviations in the phase transition regime. In the case of phase transition, there 
exists more than one Gibbs measure with respect to the same potential. Due to the variational 
principle for Gibbs measures (cf. Lanford and Ruelle [15] and, in purely information theoretical 
terms, Follmer [6]), the specific relative entropy of P to another stationary Gibbs measure Q with 
the same interaction potential vanishes. Thus, the relative entropy h(Q, P) appearing in and © 
may be zero even though Q is not contained in the closure of A. This suggests we need a refined 
analysis of large deviations in terms of surface-order rather than volume-order entropies. Assume 
in fact that the interaction satisfies the local Markov property. Then Hy{Q,P) — Hgy(Q, P) for 
any finite subset V of Z 2 , where dV is the boundary of V, i.e., the set of all sites outside of V which 
have distance 1 to V (cf. the end of Section 2 in [9]). Consequently, this relative entropy is in fact 
a surface-order term, and so it should be rescaled not by the size of the volume \V\ but by the size 
of its surface \dV\. This observation was the main motivation for introducing the concept of surface 
entropy, and for proving the corresponding Shannon-McMillan theorem. 

In the context of the two-dimensional Ising model, Schonmann [24] showed the existence of 
surface-order upper and lower bounds for the large deviations of the emperical means. For attractive 
models with a totally ordered state space, Follmer and Ort [§] found a lower bound for the large 
deviations of the empirical field in terms of the relative surface entropies along boxes (recalled as 
Theorem l6.1l in this paper). In their detailed analysis of the two-dimensional Ising model, Dobrushin, 
Kotecky, and Shlosman 4J justified on the basis of local interactions, that the phase-separating curve 
has the form of a Wulff shape. They proved a large deviation principle with a rate function in terms 
of the surface tension along the Wulff shape. Using different methods, Ioffe ([TT] and [12]) was able 
to extend their result up to the critical temperature. The appearance of such shapes suggests to 
revisit the approach of [9] but using the generalized surface entropies introduced in Section U] instead 
of the surface entropies based on boxes. 

This extension is carried out in the last section of this work, for the case of Gibbs measures with 
attractive interactions on a two-dimensional lattice. We study the large deviations for its empirical 
measure. The main result of this part of the paper is Thcorcm l6.2l which provides a lower bound in 
terms of the specific relative entropies along curves. One of the ingredients in the proof is the well 
known strategy of switching to a measure under which the large deviation becomes normal behavior, 
and then applying a Shannon-McMillan theorem. Making use of the global Markov property, we 
pass from densities restricted to the lattice points inside of a polygon to densities on the lattice 
approximations of its boundary. In this context, we prove an appropriate relative version of the 
Shannon-McMillan theorem, in analogy to the results in Section 2J Other major ingredients in 
the proof are geometrical observations concerning the interplay of the random field and the lattice 
approximations of curves. In particular, in Lemma 16.51 we compute the asymptotic ratio of the 
length of a line segment and its lattice approximation. These quantities merge into a factor in the 
lower bound in Theorem 16.21 involving the derivative of the curve. We will further touch on the case 
when the Markov property is only satisfied with respect a boundary that is a contour in the sense 
of statistical mechanics. 

Outline of the paper. The first section reviews some basic notions and properties of discrete 
random fields, information and entropy. In the second section, we introduce a specific entropy of a 
random field along a line. In Section 0] we construct the specific entropy of a random field along a 
curve proceeding in three steps: line segments, polygons, curves. Section [5] recalls some basic notions 
about Gibbs measures and phase transitions. In Section[H]we prove the refined large deviations lower 
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bound. The case of contour boundaries is briefly discussed in Subsection 14.41 and in Section [5] 

2 Random fields 

Consider ft :— T zd . where T is a finite set. For any subset V of Z d define Qy := T v . Let uiy be the 
projection of lo to V, Py the distribution of toy with respect to P, and Ty := <j(u>v) the cr-algebra 
generated by this projection. A probability measure P on (Q, T) is called a two-dimensional discrete 
random field. The transformations (O v )vez d defined by v oj(u) — ui(u + v) (u € Z d ) form the group of 
transformations on f2, called shift transformations. We assume P is stationary, that is, invariant with 
respect to the shift transformations. There arc different levels of Markov properties for random fields: 
when the subset of the lattice which generates the condition has to be finite, and when it can be 
any type of subset of the lattice. They both involve the boundary dV := { j € Z d \ V | disty (j) = 1 } 
of a subset V of the lattice Z d . 

Definition 2.1. A random field P has the local Markov property if, for any finite V C 7L d and for 

any nonnegative Ty -measurable 4>, E[(j) \ J-%d\ v ] — E[(f> \ J-gy]. A random field P which fulfills the 
local Markov property is called a Markov field. If the local Markov property holds for all any V C Z d , 
then P has the global Markov property. 

In Section [5l we will introduce the class of Gibbs measures in terms of interaction potentials. Any 
Gibbs measure belonging to a nearest-neighbor potential is a Markov field. Examples of random 
fields which have the local Markov property but not the global Markov property were given by 
Weizsacker [55] and Israel [TH]. P is called tail-trivial if it fulfills a 0-1 law on the tail field 

T ■= P| Fia\v = P) Fz*\y n , (4) 

VCTL d finite nGN 

where V n := ([— n, n] P\Z) d . Due to the spatial structure of a random field, tail-triviality is equivalent 
(cf. Proposition 7.9 from [10) ) to a mixing condition called short-range correlations: 

n — >oo 

sup | P{A n B) - P{A)P(B) | ► for all Be J. 

AEF ' z d\v n 

The following strong version of a 0-1 law was introduced by Follmer and Ort [5]. For J = it 
reduces to the classical 0-1 law on T . Remark 3.2.3 from [9] shows that it implies the global Markov 
property provided P has the local Markov property. 

Definition 2.2. P satisfies the strong 0-1 law if for any subset J of Z d the a-algebra Tj coincides 
modulo P with the a-algebra 

J 7 } := P| Fju(z d \v)- 

V<Zl d finite 

Let V and W be subsets of Z d . The information in ui restricted to V, with respect to P, is given 
by the random variable 2(Py)(u>) := — log P[coy], and the information conditioned on Tw is defined 
as T{Py\ ■ \F-w\{uj)) := — log P{uy\ ujw]- The entropy of P restricted to V is defined as 



Hy(P):=E[l(Py)(w)}=- J2 P[^v}\0gP[uJy}, 
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and the conditional entropy of P to Tw is Hv(P[- \^w]) : = P[~ log P[ky| cow]] = H(Py[- \J r w])- 
The specific entropy h(P) of P is defined as the limit of | V n \~ 1 H(Py n ) for n to infinity. Its existence 
can be proved by a subadditivity property (cf., for instance, Theorem 15.12 in [ID]), but it also follows 
from a Shannon-McMillan theorem by Follmer [6] and Thouvenot [27] . They showed that the specific 
entropy for ergodic P is E\H(Pq[- \V](u>))], where Po[- \V}{oj) is the conditional distribution of the 
random field in the origin with respect to the tr-algebra V generated by all sites which are smaller 
than the origin with respect to the lexicographical ordering on Z d . Moreover, they showed that for 
all stationary P 

j^I(iVj E[H(Pol\V](u)) \J] in L X (P), (5) 

where J is the a-algebra of all sets invariant with respect to the transformations 9 V (v £ Z d ). 

A lemma from [5] will be used in some of the proofs in this paper. We recall it here for the 
reader's convenience. 

Lemma 2.3. Consider a-algebras B{ C B* (i £ N) increasing to Boo, respectively decreasing to B^, 
and assume that Boo = B^ mod P. Then for any <fi £ L (P), 

lim sup -E[ij>\ Boo] II rv P , =0. (6) 

3 A Shannon-McMillan theorem along lines 

From now on we will consider the two dimensional case. The line with slope A and y-intercept a is 
described by the function 

h,a{ x ) : = Xx + a (x£R). (7) 

Using [x] and {x} for the integer and the fractional part of x, respectively, the two-sided sequences 
([l\,a{z)]) z£Z and ({l\, a {z)}) z£Z are the line's integer and fractional parts at the integer points 
z £ 7L. In the case when < A < 1, the lattice approximation of l\^ a on U C Z and on an interval 
ICR are given by 

L Xta {U):={(z,[l x , a (z)])\z£U} and L A)0 (7) := L x>a (I n Z). (8) 

In the case when —1 < A < 0, we use the lattice approximation L\_ a (I) := —L-\ ia {I). If |A| > 1, we 
represent the line as a function of the y-axis with the new slope 1/A (or in the case of a parallel 
to the y-axis) and proceed as before. 

We want to show the L 1 (P)-convergence of the sequence of rescaled information along successively 
increased parts of the lattice approximation to the line 

\L x ^ a {%n])\- l l{P LxA[0M) ) (nGN). 

In order to make our problem accessible to ergodic theory, we need to create a transformation 
that follows the stair climbing pattern along the lattice approximation of the line. Lemma 13.21 will 
introduce a transformation that follows the desired path. To get there we need a mechanism to 
keep track not only of the integer part, but also of the fractional part. Let r x (t) := {t + A} (t £ T) 
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be the translation by A on the torus T := [0, 1] with its ends identified and equipped with the 
Borel cr-algebra B and the Lebesgue measure /z. Consider the product space T x fi, equipped with 
the product cr-algebra T, and the product measure P = /i ® P. We will now show a few technical 
lemmata that are needed to proove the main results in this section. 

Lemma 3.1. For A G R, z, z G Z, a G T and I C Z we /iave t/ie following: 

(i) T*(a) = {l x , a (z)}. 

(ii) The junction /ias a unique zero. More explicitly: If z and so A are both positive or both 
negative, the zero is at a = 1 — {zA}. // one is negative and the other is positive, the zero is at 
a = — {z\}. If one of them is zero, then the unique zero is at a = 0. 

(Hi) l\, a {z + z) =l\ }a {z) +Xz and lx,a+z{z) = h,a(z) + 

(iv) [l\, a (z + Z)] = [h, a (z)] + [l X , r * (a) {*)] and [l\.a+z{z)] = [h,a( z )} + 
(v) Lx.ai 1 + z ) = L X,r-(a){I) + Lx.Jz). 

(vi) L x , a+z (I) = Lx, a {I) + (0,z). 

Proof, (i) By induction over z. (ii) The case z = and the case A = are trivial. Let z G Z\{0}. By 
(i), r? has a zero at a if and only if {a + zX} = 0. The latter is equivalent to a + {zX} G Z (*). If z and 
A are both positive then < a + {zX} < 2. Therefore, condition (*) is equivalent to a = 1 — {zX}. 
If z and A are both negative, {zX} is again positive and the argument works as well. If one is 
negative and the other positive, then — 1 < a + {zX} < 1, and (*) is equivalent to a + {zX} = 0. (iii) 
l\,a(z + z) = X(z + z)+a = lx, a {z) + Xz and lx,a+z{z) = Xz + a + z = lx, a (z) + z. (iv) [l\, a (z + z)] = 
[[l\, a (z)] + {lx, a (z)} + Xz] = [iA,a(*)] + [^(o) + A2] = [h, a (z)] + [Z A ,r*(a) («)], using (iii), (i) and 0. 
The second statement is an immediate consequence of (iii). (v) Using (J8]) and (iv), Lx. a (I + z) = 

{(*,pA,a(2)])|« € I + z} = {{Z + Zl [lx.a{z + Z)])\z el} = {(I,[lx,r>(a)(?)}) + (Z,{h,a(z)])\z £ 1} = 
Lx.rl(a){I) +L X ,a{z). (vi) L X ,a+z{I) = { {z, [lx,a+z (2)) | Z G /} = { {z, [lx,a+z(z)) + (0, z) | Z G /} = 

L\, a (I) + (0, z), using © and (iv). □ 

Lemma 3.2. The iterates of the transformation Sx{a,Lu) := (ta(gs), QtiAa+x])^) (ft £ T,w e fi) are 
firoen 6j/ ^(o,^) = (t£(o), Lx i0 ( n )^) /or a// n £ No. 

Proof. With «(o) := (1, [a + A]), we get S\(a,w) = (r A (a), 6> K(a) w) and S r x l (a,uj) = (r£(a), 6> Kn ( a )w) 
where /t n ; = X)"=o 1 K r A (n G N). It remains to show that /t n (a) = (n, [^^(n)]) for all a G T. For 
the first component this is obvious. For the second component it follows by induction: Trivial for 
n = 0. Then, because of Lemma EHiv), «4+i( a ) = K « \ a ) + k (2) ( t a ( a )) = pA, a (n)] + [r£(a) + A] = 

[iA,a(n)] + [/A,Tj(a)] = [M« + 1)]. □ 

We use the short forms 7\a : = 3~l x a ((-oo,-i]) and Px \ : ~ J~l x a ((_i,-i]) (« G N) for the various 
pasi c-algebren indexed by the lattice approximation along of line with slope A and intercept a. 
They play a central role in the representation of the limits in the following two Shannon-McMillan 
type theorems. We further define the functions 

F(a,u):=X(P [-\Vx,a])(u), Ffaw) :=l(P [- |P^J)(w) (* G N) with a G 1> G a (9) 

Lemma 3.3. For all a G T, u> G fi, and n G N, T(P^ X a ([o,n]))( w ) = S™=o ^ ° ^K 12 ' w )- 
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Proof. By conditioning and shifting 

n n 

p kLA,a([o,«])] = l[ p [ u} L Xia (i)\uL X:a aa,i--i])] = Il- P [' i; (o,o)|wi A , a ([o,i-i])-i A , <l (i)] °6L x , a {i)- 

i=0 i=0 

By Lemma EDV), L x , a {[0,i- 1]) - Lx, a (i) = L A)T « (o) ([-i, -1]), so 

n 

X(P iA , a ([0,...,n])H=^X(Po[-|^ (a) ])(^ A , (l (^), 

i=0 

and Lemma l3~2l concludes the proof. □ 

We will also need the following result about the asymptotic behavior of the functions Fi as i 
goes to infinity and about their properties as functions of the first parameter. Recall that Pq is the 
marginal distribution of P restricted to the origin. 

Lemma 3.4. Assume that 

VA6% (0l o) } : P [-|A]>0. (10) 

Then, (i^)j e ^ converge to F, P -almost surely, in L (P) in a; G fi. For any fixed uj £ fi, the functions 
Fi(-,uj) (i G N) are piecewise constant on T. If X is rational then F(-,u) is piecewise constant as 
well, and the convergence is uniform in t. If X is irrational, if P fulfills the strong 0-1 law and if 

3c>0 VAe%(o,o) } : P [-\A]>c, (11) 
then F(- ,u>) is Riemann-integrable on T, and the convergence is uniform in t. 

More precisely, the set of points where the function Fi(-,u>) may be discontinuous is given by 
{t\,v\ v = — 1; — *}i where t\ u is the unique zero of t\ v on T (cf. Lemma WlV ii) ) . For rational 
A, the set of potential points of discontinuities of Pj(-,w) and P(-,cj) is { = — 1, «)}, 
where p/q is the unique representation of X with integers p G Z and q G N having no common divisor. 

Proof of the Lemma. Fix any t G T. Since the er-algebras (^l a t ([-i,-i])) igN form an increasing 
family, P[w(q.o) I wij t([-i, -■*])] ) ^ s a martingale, so that we obtain by the convergence theorem 
for martingales that 

I 1 *~ > °° r I -| 

P i w (0,0) I w i A , t ([-l,-i])J > P L W (0,0) I Wi A , t {[-l ,-oo))J 

P-almost surely and in L 1 (P). By (jTUJ) this remains true when we take logarithms on both sides, 
which proves the first assertion of the lemma. 

Now fix uj <E fi and t,t € T, and find a (sufficient) condition under which Fi(t, ui) — Fi(t, uj). The 
only influence that the variable t actually has on Fi is its effect on the set L x ,t([— 1, — *]) of sites we 
condition on. By {5}, Lx,t ([— 1) — *]) = 1% z([-l, — i]) if and only if 

pA,tM] = P A ,t(^] for all i/ G {-1, -»}. (12) 

By Lemma l3TlT i). [/a,*(^)] = — A^ — r A (t), so the equality in (fl2|) is equivalent to t — t — t% (t) — t% (t) . 
This is fulfilled if and only if t and t are both either smaller than tx,i> or both larger than t x ,v Applying 
this argument to all v G {— 1, i}, we see that the function Pj(-,w) is piecewise constant, and 
the set of possible jumps is given by Di = {tx,u\ v = — 1, — i}. 
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If A is rational, these sets actually become independent of i, for i large enough. With the 
unique representation A = p/q used above, and the periodicity of the sequence (iA,i>)i/£Ni we obtain 
Di — {t\^\ v = —l,.,.,—(q A i)}. In particular, the convergence is uniform and the limit P is 
piecewise constant in the first variable. 

Now assume that A is irrational. To show that P is Riemann-integrable it suffices to show that 
the set of continuity points has full measure. We will prove that F(-,oj) is continuous on T \ Doc, 
where Poc := {t\. u \ v — —1, —2, ...}. Fix ^ £ T \ and let be e > 0. We apply Lemma \2. 31 with 
B k =Lx,t([-k,-l]), andfi| = L x ,t{[-k,~l]){J(1 2 \V k ), where V k = [-k, k} 2 . This gives us a k eN 
such that for all k > ko and ieT with Lx,t {[— k, —1]) = Lx,t([—k,—l]) (**) we obtain 



P [ w (o,o)| ^i x ,t ([-fc,-i])] _ P [ W (0,0)| W iXii ([-L- 



i])J 



< £. 



By definition of to, (5 := mm{|io — iA,i/||^ — — 1,...,— fc} is larger than 0, and by Lemma I3.1f i). 
(**) is true for all t <E T for which \t — to\ < S. So, P^mml wl a ^ s continuous in t = to- 

Note that, for all i € N, Di C Pi+i, and that the maximal height of new jumps added in step i, 
that is, the steps in Pi+i \ Di, decreases with i. This implies the uniformity of the convergence. 
Finally, by taking logarithms and using (lll|) . the above statements are also true for the sequence 
(P(-,w)) 4eN . □ 

Theorem 3.5. Let A be rational and p/q its unique representation with p £ Z and q £ N having no 
common divisor. Assume that P fulfills condition (|10[) . TTien /or a/Z a £ K, 



■^(-PiA.odO.n])) 



1 9-1 r 

-J2e[h{p [ ■ \V p/9>up/q ]{u)) J q . P 



t/=0 



(13) 



P -almost surely and in L 1 (P), where J q . v is the a -algebra of all sets which are invariant with respect 
to 0( q ,p) ■ In particular, if P is ergodic with respect to 0( q , p ) then the limit equals 



-J2E[H(P [.\V. t 



P/q, vp/q. 



(")) 



Proof. By Lemma [3~3l the left-hand side of (fTHj) equals l/(n + l)X)"=o p ° ^K ^)- Because of 
Lemma f3.4l and Maker's version of the ergodic theorem (cf. Theorem 7.4 in Chapter 1 of [Hj), it 
suffices to show that 

n 

— J2FoSi(a,u) (14) 

i=0 

converges to the limit in (|13p . For each n £ N there are unique m £ N and 77 € {0, 1, q — 1} such 
that n = m<7 + 77. The last term can be rewritten as 

<j— 1 , m— 1 

m 



- V - V P o + — Vfo 



!/=0 j=0 v=Q 



The second addend converges to as n (and therefore also m) goes to infinity. The first factor 
converges to l/q, so it remains to study the asymptotic behavior of 



771—1 

A^F(a,u;) := - £ P c Sf+>, w ). 
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Use K n (n € N) defined as in the proof of Lemma \'S. 2 1 Since = Id we obtain Kj q+L , = jn q + k v , 
and S{ q+V = (t»,6 Kv o (6 Rq y). This yields 

m— 1 



A$F(a,w) = — j[] F(r"(o),fl*" w o (6» K ' (a) ) J w). 



For a fixed, applying Birkhoff's ergodic theorem to the function /<£ (uS) := -F(r^(a), re / o )0;) yields 
Km^A%>F(a,-) = E[f t (l/) \J {q;P) ] =e[h(P [ ■ \V P / q ,„ P / g ] {u))\j( q , P )] P-a.s. and in L^P). 

In the ergodic case J7( 9j p) is trivial. Using the invariance of P under 9, the last expression reduces 
to E[F(t u (o), •)]. □ 

For the case of an irrational slope A we can show the corresponding result provided P fulfills 
some additional conditions, in particular the strong 0-1 law (cf. Definition! 



Theorem 3.6. Let X be irrational. Assume that P fulfills condition (llljl and the strong 0-1 law. 
Then for all a £ M, 



— — I(P £ ([0in]) ) ►/ E[H(P [-\V x , t ](u))]dt mL\P). 

n t J. Jn 



(15) 



Proof. As in the proof of Theorem 13.51 the left-hand side of ([13"]) can be rewritten as an ergodic 
average involving the skew product S\. Using Lemma I3~4l and Maker's version of the ergodic theorem 
(cf. Theorem 7.4 in Chapter 1 of [H]), it suffices to show that for all i € N, 



n „i 

l/(n + l)£p fe o<?i(a,a,)^^ / P[P (P [. |pg]( W ))] eft in L^P) 



(16) 



For i 6 N fixed, Pi is piecewise constant in i, so there is an P G N, intervals Ui,...,Ur and 
measurable functions /i, /r on (0,.P) such that Pj(t, w) = 53 r= x l(7 r /r(k>)- By approximation, 
this can further be reduced to the case of indicator functions at the place of the / r 's. It remains to 
show the convergence for functions of the form lu(t) 1a(w) for an interval U C T and a set A e J. 

Let Fi(t,uj) = 1[/ (i) 1a(w). We will show the convergence in L 2 (P). Using 

U(^ (t) (^)U(^. (t) c)P(^) = P(^ {t) _ Kj(4) An A), 



we see that 



1 - r 1 
— J2F(r\t),9 Ki{t) - )-jf P[P], 

1 - 

— ^i ( ,(r ! (t))i (/ (^(t))p(r 1 (i) _ %(() AnA) + Mt/) 2 ^) 5 

n r /. 

lu{r\t)) / U(0 Ki(t)W )P(^) /i(C/)P(A) 



(n + 1) 



(n+ l) 5 



(17) 
(18) 



The next step is to show that the first addend in line (|17p may be replaced by 

1 " \ 2 

+ iyEMT 4 (*))J p(A) 2 



y ' i=0 



(19) 
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without affecting the asymptotic behavior of (fT7|) . We may bound 

n 

——- J2 lv{r'it))lu^{t)){p(e-\ t) _ Kj{t) A^A) -P{Af) 

1 ' i,j=0 

1 " I 

^T^TTF E\ p (o-J (t) - Ki(t) AnA)-P(Ar. 

Fix e > 0. Let || • |j denote the maximum norm on Z 2 . As a special case of the strong 0-1 law, P 
fulfills a 0-1 law on the tail field. By ([2|) there is an m £ N such that 

e 



p(e^AnA) - p{Af 



< 



for all k e Z with II fell > m. 



Note that \\Ki{t) - Kj(t)\\ >\i- j\ for all t € T. Si 
for all to € N, we can find an no £ N such that 

1 



lim„ 



{l < hj < n \\i ~ j\ < m }\ = 



7 j ~7y2 SU P K 1 ^ ^ n I IM*) ~ < TO }| < 

(n + l) 2 teT 



for all n> no. 



This demonstrates that hat the difference created by the by replacing the first addend in (TIT)) by 
(|19[) converges to uniformly with respect to t. 

Since {/ is an interval, the function Xjj is integrable in the sense of Riemann. Applying a variant 
of Weyl's theorem for Riemann integrable functions (cf., for instance, Theorem 2.6 in Chapter 1 of 
|14j ) implies that, for n to infinity, l/(n+ 1) X^^Lo lc / ( 7 "*(*)) converges to n(U) uniformly in t. So, 
asymptotically, the sum of the two addends in line (fTTj) equals 2 fi(U) 2 P(A) 2 . 

The expression in (fTHj) can be simplified to — 2/(n + 1) X)"=o l^l 1 " 1 WW^O-P^) 2 - Applying a 
variant of Weyl's theorem for Riemann integrable functions again, this expression converges to 
—2 /i(U) 2 P(A) 2 , which concludes the proof. □ 



4 A Shannon-McMillan theorem along general shapes 

Let P be a stationary random field that satisfies the strong 0-1 law and the condition (fTTj) . The goal 
of this section is a Shannon-McMillan theorem for a stochastic field along the lattice approximations 
of the blowups of a curve c, and an explicit formula for the limit h c {P), the specific entropy of 
P along c. This will be done in three steps: for linear segments, for polygons and for curves. In 
Subsection 14.41 we will further introduce lattice approximations that are contours in the sense of 
statistical mechanics and sketch corresponding results for them. 

Let c = [c^\ be a piecewise differentiable planar curve parametrized by t € [0, T]. Assume 
that the trace of c does not contain the origin and that it hits the y-axis in t = 0. Let c' denote the 
right derivative of c. The blowups of the curve c are given by 

B v c : [0, rjT) — ► K 2 , B n c(t) = r? c{t/rj) fa > 0). (20) 

It will follow from the construction in Subsection 14.21 that is enough to consider curves that are 
described by the graph of a function <f> on a segment of one of the axes. Suppose that <j> is a 
function on the interval [x,x\ of the x-axis. (The case of the j/-axis can be treated analogously.) 
More precisely, x = cW(0) and x = c^(T). The interval [x,x\ contains a finite number u of integers 
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z, z + u, where u = [x — x] oru= [x — x\ — 1. In the same way, the blowups B n c can be represented 
as graphs of functions cf> n of intervals [x n ,x n ]. We obtain by ([2"Uf . x n = B n c^(0) = nx = nc^(0) 
and x n = B n c^ (nT) = nx — nc^(T). Again, the interval [aj„,£ n ] contains a finite number u n of 
integers z n , z n + 1, z„ + u n , where 

u n = [n(x — x)] or u n = [n(x — x)] — 1. (21) 

In particular, the sequence (u n ) ne n goes to infinity. 



4.1 Line segments 

The main result of this section is the convergence of the sequence of renormalized information 
functions 

TAk^-w)^ (22) 

along the lattice approximation converges to the entropy h\(P) of P along a line with slope A. If c 
is a line segment, the functions 4> n are of the form 4> n (x) — \x + a n (x € [nc^ 1 ^(0),nc ( ^ 1 - ) (T)]) where 
A = (c( 2 )(T) - c( 2 )(0))/(c«(T) - ^^(O)) and a„ = 7i( C ( 2 >(0) - Xc^(0)). Assume < A < 1. As 
explained at the beginning of Section [3l the other cases can be reduced to this case. At first sight 
it seems we could just apply the results for the specific entropy along a line from the last section. 
But the blowups of the line segments move in space, which has the following consequences: 

(i) There is a sequence (a„)„ e N instead of a constant a. 

(ii) The sequence is real-valued, as opposed to the constant having values in T. 

(iii) The positions of the lattice points in each step are more difficult to describe. In the case of the 
line we simply looked at approximating points with x-values between and n. Now, x-values 
of the lattice points lie in the interval between z n and z n + u n . 

The last problem forces us to apply, at each step n, an additional shift to uj which brings the line 
segment close to the origin. These shifts do not affect the L 1 (P)-convergence since the limit is shift 
invariant. The number of points in the nth step is given by (w n )rieN7 instead of simply n + 1 as in 
the last section, but this is irrelevant as long as the sequence goes to infinity. The second problem 
requires another shift in each step n. The first point is the most delicate. It is here that we need the 
convergence of the ergodic averages in all t, rather than just almost all t. 

Theorem 4.1. In and uniformly in a £ K, 

Proof. We start with some technical preparations. Set a := a\. Using the notation for [x,x] 
described above (|2T|) . define for neN, L n {a) := L\^ an {z n , . . . ,z n + u n ). The total number of sites in 
L n (t) is u n + 1 . To transform (f2"2")l into some sort of ergodic average we first condition on successively 
smaller parts of L n (a). A new step begins at i if and only if i"^" +i_1 ({a}) > 1 — A. For all z,i G Z, 
and a £ K, 

Lx, a (z + Lx. a (z) = Lx, T ~({a}){-i)- (23) 
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To prove this, first apply the second equation in Lemma l3.1f v) with a = {a} and z = [a], and then 
apply the first equation with z = i, 

LxA z + i) = (z + i, [l\,{a}{z + *)]) + (°. H) 

= (z + i, [/>,{«}(*)] + pA,T*({a})(*)]) + (°> H) 

= (Z, [h,a( Z )]) + (h lh,T*({a})(i)}) = L\,a{z) + L X>T * ({a}) (i). 

We calculate the information in ([221) by conditioning site by site along L n (a). We use ui{i) instead 
of cjj for easier reading. Shifting lu to the origin, applying (|23p and using the functions defined in 
© yields 



2"(-Pl„(q))(w) = -J^logP u}(Lx, a „(z n + i)) u>(Lx,a n (z n + i - 1 

i=0 

= -^k>g.p|u;(0,0) | U { L >;a n { z n + 4-1, —,Zn) - L X ,a n { z n + «)) 
i=0 

= -£lo g pU),0) w(L ,„ +i({0n}) (-l,...,-i)) 



! • * * ; ; 



0< ^A,a„ (*»+<) 



W°^A.. n (*n) ( 24 ) 



= -E F 4^(r A z "(K}))^L A , Tj „ ({an})W °^ (z „)^). 

i=0 A 

Applying Lemma 13.21 putting all together back in (124p and renormalizing yields 
-^-X(P L „ (Q) )H = -^f^F i0 Si{T*"({a n }),6 LX:M u; 

U"n. \ J- Lin | -L 



(25) 



i=0 



To prove the convergence, we have to distinguish the case when A is rational from the cases when it 
is irrational, because this determines whether tx is periodic or uniquely ergodic. We proceed as in 
the proof of Theorem 13.51 and Theorem 13.61 respectively. □ 



4.2 Polygons 

The next step is to define the entropy along a polygon, that is a piecewise linear curve tt : [0, T] —> R 2 . 
Assume further that tt fulfills the other assumptions on c stated at the beginning of this section. Let 
R be the number of edges of tt. We can find slopes A^ 6 (—1, 1], constants t^ r ' € R, and intervals 
l( r > of the x- or the j/-axis such that 

R 

7r([0,T]) = U^), tM (/ (r) ), (26) 

r=l 

with lx,t as defined in (O as a function of the x- or of the y-axis. Proceeding the same way for the 
blowups B n 7T (n £ N) as defined in (|2T)|) , we choose tffl G R and 1^ C R, such that 

S„7r([0,T]) = U z AM,e>(4 r) ), 

r=l 

The lattice approximations of the edges combine to a lattice approximation of B„tt : 

R 

r=l 
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Theorem 4.2. The lattice approximations converge. More precisely, 

1 R 

— — ^length 7r< r >/i A cr)(P) m£ x (P). 



length L£ I ^ Pl ^ > ^ length tt 



(28) 



r=l 



In some contexts it is more convenient to express the limit as an integral with respect to t rather 
than as a sum. Let tt' {t) denote the right derivative of tt. Then the limit can be written as 



1 * / K, {t) {P)dt. 

length tt J 



(29) 



Proof of the theorem. Use for u>i and define the sets E n r ' :— L x(r) t ( r) {In 1 ) (r € {1, . . . , R}). 
By conditioning, 

J(P £S )M = £ logP[u,(pM) I c(P<T ^ . . . , PW)] . (30) 

r=l 

Fix re {1, . . . , P}. Omit the index r when there is no risk of confusion (for example A := X^'\ t n := 
tn ,E n :— E n r ') and use the short form E n := E n r ~^ U • • ■ U En for the lattice approximations of 

(r) 

the edges of the polygon which come prior to E n ' . We will condition successively on the elements of 



E n r . Denoting the integers in 7„ by z n , z n + 1, . . . , z n + U n as in ()21j) .we obtain for the rth addend 
in P]l 



logP w(P„) w(P„)J = J^logP Lj(Lx, tn (z n +«)) u(L x ,t n {z n +i - 1 
i=0 

Shifting by w m (<) := Lxj n (z n + i) yields 

^logP w(0,0) w(L A , t „(z„ +i - 1, .. . ,z„) - v ni (t),E n - v ni (t)) 



By 



i=0 

this equals 



£> g pL(0,0) 



i=0 



W ( L A : r^ + l ({t„})( _1 ' • • • > ^)-- E « - U ni (t)) 



(31) 



This expression is similar to ([2"4")l except for the additional conditionings on the sites E n — v n i{t). 
We will show that these conditions disappear asymptotically. The argument will be given in detail 
for the first summand; it is similar for the remaining ones. Let a be the minimum angle between 
any neighboring edges of the polygon tt and let d n be the minimum distance between an edge of 
the nth blowup B n TT of the polygon and any of its nonneighboring edges. Also, let H n be the 

(r) • (r) 

hexagon defined as follows: H n is symmetric around E„ , two sides are parallel to En at a distance 

(r) 

d n /2. The other sides reach from the endpoint of the first two to the endpoints of E n , and they 
intersect at an angle a. Observe that E n C Z 2 \ H n , and therefore E n — v n i(t) C Z 2 \ (H n — v n i(t)) . 
Define the cr-algebras B t (t) := T(L X iT * ({t}) (-1, . . . , -i)) , := J r (L Xr ^ {t}) (-l, -2, . . .)) and 

B*(t) := J"(L AT i ({t}) (-l, ...,-i)UZ 2 \ (H n - « n » (*)))• The sequence (B l (t)) i( _ N is increasing to 
Boo(t), and the sequence (B*(t)) ieN is decreasing to fi^(i) := f] ieN B*(t). By the strong 0-1 law, 
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B^(f) = B^it) mod P. By Lemma 



lim 

i — >oo 



logP 



w(0, 0) w(i A)T « (t) (-1, . . . , -i), £ n - u n <(t)) 
-lo g pL(0,0) ^(L A)T * ({t}) (-l,.. 



■i)) (a,) 



= 0. 



Proceeding with (|3"Tj) as with (JM]) and using that, for all r £ {1,...,R}, length E n r '/ length 
asymptotically equals length -k^ /length tt concludes the proof. □ 



4.3 Curves 



To use the results from the previous section, wc need to relate the derivatives of the curve with a 



slopes of lines. Let v £ S 1 = {w £ 



; | = 1}, and a the angle from the positive a;- axis to the 



vector v. If \a\ < ir/4 or \a\ > 3ir/4 then describe the line in the direction of v by a function of the 
x-axis; otherwise describe it as a function of the y-axis. We assign any v E S 1 & specific entropy 



h v (P) := hx( v ) (P), where X(v) :— minQtg a\, \cta\). 



(32) 



Theorem 4.3. Let c : [0, T] — > R 2 be a piecewise continuously differentiable curve. Assume the 
trace does not contain the origin. Let ir n : [0, nT] — ► M 2 (n £ N) be a sequence of polygons such 
that 

1 



Then, in L 1 (P), 



length tt„ te[0 ,„T] 
■AP Ll ) 



Sup Kfl-Vn/W-c'Wl 



0. 



(33) 



1 



1 



length n r . 



length c J 



h c , {t) {P)dt. 



Proof. We have to show that 



lim 

n — *oo 



1 



-1{P L 



1 



h c , {t) {P)dt 



0. 



(34) 



length 7r„ " length c J 

Without loss of generality we can assume that c has no self-intersections and that c is parametrized 
by arc length. As can be seen by the construction of the entropy for polygons, 

1 



■X{P L% ) - h Wn (P) 



L>) 



length n ri 

converges to 0. By the representation formula in Remark [29] and since Tr' n (t) = ((i3„7r n ) _1 )'(£/n), 
for all t £ [0, nT], we obtain 

1 f nT n f T 

^ (P) = ie^ith^/ h <^ P)dr= \^T n j h ((B^)-mt)(P)dt, 

and by (|33j) and Lemma l3.4| the integral converges to h c >^ dt. Use || ■ || for the euclidian norm 
in the plane. Using 



— length 7T r , 
n 



T \\n' n (nt)\\dt= f \\{B- l )'{t)\\dt, 
o Jo 



we obtain by (133]) 



lim — length 7r„ 

n^oo n 



\c'(t)\\ dt = length c. 
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This implies that 



h*n(P)~\ 77- / h o{t){P)dt 

length c J V(p) 

converges to as well, and (f3"4"| follows by the triangle inequality. □ 

Note that the limits do not depend on the sequence of polygons we used to approximate the curve 
and that any approximation of the curve by lattice points can be described by a lattice approximation 
of a suitable polygon. This justifies the 

Definition 4.4. Let P and be as in TheoremU.tA Then 



length c Jo 

is called specific entropy of P along c. 

Note that the condition that the trace of c does not contain the origin is no real restriction for 
the definition as the expression on the left-hand side only depends on the derivative of c. This is 
intuitive, because we assumed that P is stationary. Note the following property for the entropies of 
the blowups of a curve defined in (|20|) . The proof is a simple scaling argument. 

Corollary 4.5. Let c : [0, T] — > R 2 be a piecewise differentiate, and let B v c : [0, r]T] — ► R 2 with 
B v c(t) — T)c(-) (rj > 0) be the family of its blowups. Then h,B n c{P) = hc{P) for all r\ > 0. 



4.4 Contour approximation 

In statistical mechanics, a contour is a set of sites corresponding unambiguously to a chain of bonds. 
Note that the lattice approximation is not a contour in this sense. The last site before a new step is 
catercornered from the first site of the step, so not connected by a bond, and closing the gap is not 
uniquely defined. We define the contour approximation by adding, at each new step, the site which 
is one unit below it. A new step begins in i + 1 if and only if r{({a}) > 1 — A, and the site we add 
in this case is L\ <a {I) — (0, 1). For z £ Z and aeN, 

L x . a (z, ...,z + u):= L x , a (z, ...,z + u)U {L Ka {i) - (0, 1) | < * < u - 1 A t z x +1 {{o\) > 1 - A} 

be the contour approximation of the line segment l\ t a{P)- Lemma 13.11 and Lemma 13.41 translate 
immediately to L. With little modifications we can prove contour versions of the limit theorems 
derived earlier in this section. We will sketch the results here and refer to [T] for details and proofs. 

For n £ N, define L n (a) :— L\,a n ( z n, ■ ■ ■ > z n + u n ), and for a G R and z, z G Z with z > z define 
L{jz,...,z):= (35) 

L x>a {z- 1, . . . , z) U (La,„(2) - (0, 1)) if rf- x ({a}) > 1 — A, 

L\ t a(z — 1, . . . , z) otherwise. 

The specific contour entropy along a line h\(P) with slope A is defined as 

"1 /-i \ 



1 



1 + A 



/ E[H(P [- 1 ^(4 it (-N))]( W ))] dt + / E[H(P [- 1 ^(£ A ,t(-N) U {(0, l)})](w))] dt 
lo Jl-X J 
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We can show a contour version of Theorem 14.11 

[ n — >oo ^ i 

— l{P Zn[a) ) > h x (P) in l/(P) and uniformly in a G M. 

The formulations of contour versions of Theoreml4.2l and Theorem 14.31 are now obvious. 



5 Gibbs measures and specific entropies 

A collection (Uv)vci d finite °f functions on f2 is called stationary summable interaction potential if 
the following three conditions are fulfilled: (i) Uy is measurable with respect to Ty for all V C T, d . 

For all i e N and all finite V C Z d , Uy +i — U v ° 0i- (hi) XVcz d finite: oev II ^ H°°< 00 • Let 
£, rj € f2 be two configurations. The conditional energy of £ on given the environment 77 on Z d \ V 
is defined as 

AcZ d finite: 

where (£,7?)y is the element of given by (£, T})v{i) '■= for i S V, and {£,rj)y(i) := for 
i € Z d \ V. P is called Gibbs measure with respect to U if for any finite subset V of Z d the conditional 
distribution of u>v under P with respect to T%a\y is given by 

P[iOy = £y I T zd \y]{r,) = -J—e^C^) where ^( ) ;= / e -EvKM p(^) 

is called partition function. We say that there is a phase transition if there is more than one Gibbs 
measure with respect to the same interaction potential. 

Assume that T is furnished with a total order <, and denote by — the minimal and by + the 
maximal element in T. Suppose that U is attractive with respect to the order on T, in the sense of 
(9.7) in [53]. Let P~ and P + denote the minimal and the maximal Gibbs measure with respect to 
U, and let P a = aP~ + (1 — a)P + (0 < a < 1) be their mixtures. Both P~ and P + are ergodic 
and, as follows from [7], they fulfill the strong 0-1 law and the global Markov property. Follmer and 
Ort [9] define the specific relative entropy based on hyperspaces by 

s(P-,P+) = ~J2 f H(p a -[-\^](c),P+[-\^](cj))p-(du;), (36) 
1=1 n 

where J 7 ^ is the er-algebra generated by those coordinates in {(i^ , iW) € 7L d \ = 0} which 
precede in the lexicographical order on Z d . 

In the two-dimensional case, the conditions in p6|) are simply along the coordinate axes. Based 
on the work in Section HJ we can now extend this definition to a surface-order entropy along any 
direction v £ S . Furthermore, we can introduce an entropy along curves. 

Definition 5.1. Let v € S . Let c : [0, T] 1 — ► M 2 be a piecewise differ entiable curve parametrized by 
arc length with right derivative c'. 



h v (P~,P^ 



J J H(Pa [ ■ \Vx M , t ] («), P + [ • \V x(v) , t ] (w)) P-(du)dt 
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is called specific relative entropy of P with respect to P + in direction v. 

h c (p~,p+) :=r-^- [ T h c , (t) (p-,p+)dt 

length cj 

is called specific relative entropy of P~ with respect to P + along c. 

The order on T induces an order on the set A4i(T) of probability measures on T : We say that 
/i is larger then v if the density is an increasing function with respect to the order on T, and in 
this case we write fi > v. In particular, v is absolutely continuous with respect to fi. The following 
inverse triangle inequality for relative entropies was shown in the proof of Theorem 4.2 in [9], For 
the reader's convenience we state it in the following form. 

Lemma 5.2. Let A > /i > v, and assume that [i is bounded below by a positive constant. Then 
H(v, A) > H(v,n)+H{n,\). 



6 Lower bound 

Let P~ and P + be the minimal and maximal Gibbs measure defined in Section \5\ The following 
lower bound for the large deviations of the empirical field R n (to) :— YlieV &0hj of P + was proved by 
Follmer and Ort [9]. Recall that V n is the set of all lattice sites in [— n, n] d , and that the boundary 
of a subset V of 7L d is defined as dV = {i G Z d \ V | dist(i, V) = l}. 

Theorem 6.1. For any open A 6 Mi (ft), 

liminf-^— logP+[i?„e A] >- inf yfc s(P~ , P+). 

"■ N \oV n \ a:P a eA 



n — >oo 



The aim of this section is to improve the lower bound by replacing the boxes by more general 
shapes in the two-dimensional case. The corresponding Shannon-McMillan theorems developed in 
Section [1] will be the key to the proof. For a closed curve c let int c be the subset of K 2 surrounded 
by c. Define the sets 

C a := {c | c : [0, T] — ► R 2 closed piecewise C -curve parametrized by arc (37) 
length, without self-intersections, and with areaintc = a}. 

Theorem 6.2. For any open A G Mi(£t), 



lim inf log P+Ti^ € A] > - inf inf i f ■ = =h c (p-,P^ 

— \dV n \ & 1 J " a -.p a eAcec a Aj ^1 + A(c'(t)) 2 V 



Remark 6.3. Replacing the class C a by squares with area a this bound coincides with the bound 
in Theorem 16.11 Let it be a square parametrized by arc length and with area int n = a. Then the 
length of every edge is y/a. For the two horizontal edges of the square the slope A (cf. (|32|) ) is with 
respect to the x-axis, and for the vertical edges it is with respect to the y-axis. Therefore, the 
integral equals 4 v / oT The entropy h n (P~ , P + ) equals s(P~,P + ), since the er-algebras "Po,* coincide 
with for the horizontal edges and with J 7 ^ for the vertical edges. 

Remark 6.4. In the case where the Markov property holds only with respect to the contour bound- 
ary we can state a bound similar to the one in Theorem l6.2l bv replacing the lattice approximation by 
the contour aproximation. The proof is essentially the same as for Theorem 16.21 (cf. pQ for details). 
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The rest of this section is devoted to the proof of Theorem 16.21 We will need some properties 
of the geometry of lattice approximations of polygons and their interplay with the random field. 
To begin with, we restate explicitly the global Markov property for random fields in the case when 
the conditioning is concentrated on a set of sites surrounded by a closed polygon tt without self- 
intersections. We use the notation T(c) := intclH Z 2 to indicate the set of lattice points surrounded 
by a closed curve c. By the definition ([2"T|) . d{l? \ r(-7r)) = L x , and the global Markov property 
(cf. Definition (|2.ip ). with V = Z 2 \T(tt), we have that for any Til? \ r(7r))-measurable nonnegative 
function 

E[4>\Fr M ]=E[4,\T L .]. (38) 

We will further need two lemmata that compute the asymptotic fractions of the lengths of the 
blowups of a line segment, or a polygon, and the sizes of their lattice approximations. 

Lemma 6.5. Let I be a real interval, l(x) = Xx + a be a linear function with slope A, and B k (k E N) 
be the sequence of its blowups restricted to I. If Lk is the lattice approximation of Bk then 

km 1^' 



fe^oc length^ Vl + A 2 

Proof. We consider only the case when < A < 1, that is, when the lattice approximation is 
given by L(z) — (z, [l(z)]) (z E I C\ Z). Other cases only differ in terms of notation. For any k € N, 
\Lk\ is either [length bk] or [length bk] + 1, where bk is the projection of Bk to the x-axis. We can 
ignore the second case, since the additional point does not matter for the limit. Observe that 
(length£? fc ) 2 = (length^) 2 + (A length bk) 2 - Consequently length bk = length £? fe / (Vl + A 2 ), which 
proves the convergence. □ 

Lemma 6.6. Let tt be a polygon with edges tt t (r = 1, . . . , R) and X r (r = 1, . . . , R) their slopes as 
defined in (I32p . Let B k TT (k E N) be the blowups of tt and Lkir (k E N) their lattice approximations. 
Then 

j. | L k n | _ 1 length n r 



length B k TT f-^ y/l + A 2 lengths 



Proof. Using length BkTT r — k length 7iy, we obtain 

| XfcTT ] ^ \Lkn\ length B k ir r ^ \L k Tr\ length ir r 



length BkiT ^-^ length BkiT r length BkTT length Bkir r lengths 



By the previous lemma applied to the individual sides, the first factors converge to l/yr+A 2 , 
which prove the statement of the lemma. □ 

Proof of Theorem IB. 21 : Let be < a < 1, such that P a E A. Since A is open, we can choose 
open neigborhoods A~ and A + of P~ respectively P + in A^i(fi) such that aA~ + (1 — a)A + C A. 
Without loss of generality we may assume that A~ and A + are in !Fv p for some p E N. Define the set 
Il a := {tt J tt closed polygon without self-intersections, areaint7r = a}. Let tt E Il4 Q with E int7r, 
and let (B n 7r)„gN be the sequence of blowups of tt. For a — 1 take C n :— V n . Otherwise, define 

C n := r(B fe( „)7r) and D n := V n \ T(Bi (n) Tr), 

n — >oc 

where k(n) and l(n) are chosen such that k(n) < l{n), l(n) — k(n) > oo, 

lim | " ) = a and lira ■ . " j = 1 — a. (39) 

»woo I V n | n^oc V n 
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To see that such sequences exist we show that k(n) = [-^/ci!|V^|/areaint tt ] and l(n) = \k(n) + y/n] 
fulfill the conditions. Obviously, both the sequences and their difference tend to infinity as n goes 
to infinity. Using areaint (Pfc7r) = fc 2 areaint7r, 

\T(B k n)\ ^cc x ^ ip^n-^ a 



areaint B^ir \V n \ areaint7r 

We obtain for the first expression in 

\C n \ areaint (B fc („)7r) k (n) 2 areaint tt 
hm = hm j — — — = hm : : = a. 

n->oo | V n | n^oc \ V n \ n^oo \ V n \ 

Similarly we see for the second expression in (|39|) . 

\D n \ \T{Bu n ))\ , ,. I (n) 2 area hit tt 
hm 7TT7 = 1 ~ hm — FtTi — = 1 ~ hm \TF~\ ' 

By definition of l(n), l(n) 2 = k(n) 2 + [2k(n)^/n\ + n. The last two addends are of order n and will 
go to when divided by | V n |. It remains to study k(n) 2 areaint tt / | V n |, but we already know from 
the second statement in (|40| that this converges to a. 



Define 



where C„ :P := V k ^_ p and £>„ !P := V n - P \ Vj(„) +p . Then {P„ G A } G .F Cn , {P+ G A+} G J 7 ^, for 
large enough n, {R n £ A} {R~ G A - } n € A+} := A„. Define the measures 

Q n = P Cn <g> P^\ Cn (n G N). 

Qn coincides with P~ on Tc n and with P + on To n , and makes these er-fields independent. Thus 
we obtain Q„[A„] = P~[R~ £ A~] P + [P+ G A + ], and by the ergodic behaviour of P~ and P + , the 
sequence Qn[A„] (n £ N) converges to 1. Let 4>n denote the density of Q n with respect to P + on 
J(7„ud„- Then for 7 > 0, e > 0, and for large enough u, 

P + [Rn G A] > P+[A n ] > /" 1 A </>„ rfQ ■n 

> exp(-( 7 + e) Q n [A„ n {\dVn\~ 1 lo g 0„ < 7 + e}] . 

By the convergence of the Q n [A n ], the lower bound liminf„_> 00 | c?V^ | ~ 1 logP + [P„ G A] > —7 follows 
if 7 is chosen such that, for any e > 0, 

lim Q„[|9F„|- 1 log^< 7 + e] = 1. (41) 

n — >oo 

We will show that (|4*Tjl holds with 

1 length 7r. r 



7 = y , & r k{p-, p +). 



Since Q„ = P + on £)„, and the fact that both P and P + are Gibbs measures with respect to the 
same potential we obtain 

P-[wcJ-P + [^J P-[wo n ] 

4>n{U) ------ 



^ + [WC„UD„] -P [wc„|Wu n ]' 
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Let L n be the lattice approximation of B n TT. By 



P [Wc„ I = P [cJo n | UC„] 



P-[uc n ] 



and thus 



P-r i i p "hJ p-r i i p [^m] 

= "p - ? i V = T5T7 i v ( 42 ) 

Going around the R sides of Lu n \, and conditioning site by site as in the proof of Theorem 14. 2[ we 
obtain 

where the \Jj( r ) corresponds to the r-th side of the polygon. Similar to the calculation between (f5TT|) 
to BU) we obtain 



U %, t „(2„)> 



i=0 



where A is the slope of the rth side of the polygon, t n and r n are as in Subsection 14.21 and Z n ^ >t 
^n,i,t — Yn,i,t> with 

X n ,i, t =logP - (w(0,0) | to(L[ T ,„ +I({tn}) (-1, . . . , -i) U A n ,i, t )), 
logP+ (o;(0, 0) | w (L* iT ^ +1({U) (-l, . . . , -i) U B n , iit )). 



and Y, 



To simplify notation we have omitted the index r. For the sets in the conditional expectations we 
have A n> i t t C B n: i, t Cl 2 \ (H n — L\j n {z n + i)). A n> i > t is obtained by shifting a subset of L n C C n . 
H n is constructed as in the paragraph above, but using the minimum of the diameter d n and the 
distance l(n) — k(n) in place of d n . 

To prove convergence, we study the X and F-parts separately. Because of the way the sets A n) i t t 
are constructed, the behavior of X n ^^ under Q n is the same as under P~ . But the proof of Theorem 
n shows that 



i=0 



converges to —h w (P ) in L 1 (P ). The convergence remains true when we replace X n ^, t by 



logP - (w(0, 0) | uj(L XtT * n+i{{tn}) (-l, -i)) ) , 



where, for a subset L of Z 2 , the element w(L) equals w on I and assumes the minimal state in T 
outside of H n — L\ t t n (z n + i). To control the behavior of Y n ^ tt under Q n define Z~ { t = X~ { t — Y n ,i,t- 
Use the law of large numbers for martingales with bounded increments in its L 2 -form to replace 



1 Un 
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by 



1 «„ 

— 7| H E [ Z n^,t ° ^, T f» C{t „})W ° 6 Lx,t n {*n) I AHi,t], 
fc(n) I ,; = n 



where -4 n ,i.t is the cr-field generated by the sites in D n and those sites of which precede i in the 
canonical ordering of These conditional expectations can be written as the relative entropy 

H (y, n), with the random measures 

/i(u) :=P - [ ■ |w(£ Ajt .„-n ({tB}) (-l, ■ ■ • ,-*))"] 
and v{uj):=P^[- | w(i A)T »»+i ({tn}) (-l, . . . , — i) U P n ,i,t)] . 

Now we want to replace /i by a measure 77 for which 



\L 



(-1, . . . , -i) I)" 1 £ fffo r?) o e Lx }) (i) o La 



(43) 



converges to /i A (P ,P + ), fnL (P ), as n goes to infinity. Define uj(L) + in analogy to u>(L) .Since 
for all u>, 



>(L Xtt (-l, ■■;-*))' 



Pn 



P\, 



we obtain (j4H|) by taking 



77(0;) 



Pn 



By Lemma 15.21 H(v(u>), fi(u>)) < H(i/(u>),r](LL>)). Summing over r = 1, . . 
convergence in L 1 (P _ ) to stochastic convergence with respect to Q n yields 

lim Q n [\L k(n) \- 1 (f> n >h n (p-,P + )+e] = 

n— >oc 

for any e > 0. To derive (j4"T|) it remains to show that 

length 7r r 



, R, and passing from 



lim 

n->oo I I 



y L 



^VTTAf 8 
The fraction on the left-hand side can be written as a product: 



(44) 



J fc(n) 



L 



k(r. 



k(n) length 7r 



\dV n \ length B k{n) 7T \3V n \ 

We first study the asymptotics of the second factor: As we are only interested in the limit behaviour, 
we can drop the brackets for the integer part in the definition of k(n) and just use y a\ V n |/ areaint tt. 
The denominator equals 4 1 V n |. Reducing the fraction yields 1/8. The limit of the first factor in ([6|) 
was computed in Lemma 16.61 So, the second factor in ^ converges to length 7r/8. Combining all 
yields (|i4). 

Now, we replace the polygon 7r by the polygon n = B 1 / 2 tt- Since length 7r r = length n r /2, and 
since, by Corollary E51^(P- , P + ) = K{P~,P + ), 



7 



1 length 7T r 



%(P-,P+). 



^ v/t+ai 4 

Finally, by Lemma 13.41 the infimum of that function over all polygons tt 6 IIq, equals the infimum 
over all curves c £ C a . □ 
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